feat: use `partman` and s3 for archival partitions #610

maschad · 2025-05-07T23:31:16Z

As it stands

docker exec atoma-node-postgres-db-1 psql -U atoma -c "SELECT pg_size_pretty(pg_total_relation_size('stacks')) as size, pg_size_pretty(pg_relation_size('stacks')) as table_size, pg_size_pretty(pg_total_relation_size('stacks') - pg_relation_size('stacks')) as index_size, reltuples::bigint as estimated_rows FROM pg_class WHERE relname = 'stacks';"
  size   | table_size | index_size | estimated_rows 
---------+------------+------------+----------------
 7314 GB | 584 kB     | 7314 GB    |           1108
(1 row)

reveals

• Total Size: 7314 GB (approximately 7.3 TB)
• Table Size: 584 kB (actual data)
• Index Size: 7314 GB (indexes)
• Estimated Row Count: 1,108 rows

Which shows our indexes are excessively large, whilst this will partly be resolved by #609 to stem the growth, we will still need to clear any bloat that may be stored in TOAST.

This means will need a back up of records as well as going forward once fiat payments are integrated we will also need to store user stack related for the foreseeable future, which means it will need to be archived at intervals to avoid deletion.

This integrates https://github.com/pgpartman/pg_partman to manage archiving via s3

Copilot

Pull Request Overview

This PR introduces database maintenance enhancements by integrating pg_partman for partition management and S3 for archival and restoration of data. Key changes include:

New scripts for database maintenance operations (including partition initialization, archival to S3, and restoration).
Configuration updates in docker-compose and Ofelia for scheduling maintenance tasks.
Prometheus monitoring integration for the archival process.

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
scripts/db-maintenance/run-maintenance.sh	Adds a script to execute maintenance tasks, including installing extensions and running SQL scripts.
scripts/db-maintenance/restore-from-s3.sql	Implements a PL/pgSQL function to restore data from S3.
scripts/db-maintenance/init-partioning.sql	Provides partitioning initialization using pg_partman; note the operator spacing issue.
scripts/db-maintenance/archive-partitions.sql	Creates functions and configurations for archiving partitions to S3.
scripts/db-maintenance/.env.example	Supplies environment variables required for S3 configuration.
prometheus.yml	Configures a new Prometheus job for monitoring the archival process.
ofelia.ini	Defines a scheduled job for monthly archive operations.
docker-compose.yaml	Updates service definitions to include maintenance tasks and ensure proper volume mappings.

Copilot · 2025-05-07T23:39:14Z

scripts/db-maintenance/init-partioning.sql

+	p_template_table = > NULL,
+	p_retention = > '36 months',
+	p_start_partition = > date_trunc('month', CURRENT_DATE - interval '3 years') :: text


There is an extra space in the parameter assignment ('= >' instead of '=>'). This syntax error could prevent the partition maintenance function from executing correctly. Please update to use '=>' consistently.

Suggested change

p_template_table = > NULL,

p_retention = > '36 months',

p_start_partition = > date_trunc('month', CURRENT_DATE - interval '3 years') :: text

p_template_table => NULL,

p_retention => '36 months',

p_start_partition => date_trunc('month', CURRENT_DATE - interval '3 years') :: text

Copilot · 2025-05-07T23:39:14Z

scripts/db-maintenance/init-partioning.sql

+	p_template_table = > NULL,
+	p_retention = > '36 months',
+	p_start_partition = > date_trunc('month', CURRENT_DATE - interval '3 years') :: text


Please remove the extra space in the parameter operator; replace ' = >' with '=>' as required by the function's syntax.

Suggested change

p_template_table = > NULL,

p_retention = > '36 months',

p_start_partition = > date_trunc('month', CURRENT_DATE - interval '3 years') :: text

p_template_table => NULL,

p_retention => '36 months',

p_start_partition => date_trunc('month', CURRENT_DATE - interval '3 years') :: text

docker-compose.yaml

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

feat: use partman and s3 for migrations

03e3534

maschad requested review from Copilot and jorgeantonio21 and removed request for jorgeantonio21 May 7, 2025 23:38

maschad self-assigned this May 7, 2025

maschad added atoma-service deployment labels May 7, 2025

Copilot AI reviewed May 7, 2025

View reviewed changes

maschad changed the title ~~feat: use partman and s3 for migrations~~ feat: use partman and s3 for archival partitions May 7, 2025

maschad and others added 2 commits May 7, 2025 18:45

Update docker-compose.yaml

e54e971

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

chore: archive data after 1 month

29d50f1

maschad mentioned this pull request May 7, 2025

chore: use partman for archiving AtomaAI/atoma-proxy#459

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: use `partman` and s3 for archival partitions #610

feat: use `partman` and s3 for archival partitions #610

Uh oh!

maschad commented May 7, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 7, 2025

Uh oh!

Copilot AI May 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: use partman and s3 for archival partitions #610

Are you sure you want to change the base?

feat: use partman and s3 for archival partitions #610

Uh oh!

Conversation

maschad commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI May 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: use `partman` and s3 for archival partitions #610

feat: use `partman` and s3 for archival partitions #610

maschad commented May 7, 2025 •

edited

Loading